Speech intelligibility derived from asynchronous processing of auditory-visual information
نویسندگان
چکیده
The current study examines the temporal parameters associated with cross-modal integration of auditory-visual information for sentential material (Harvard/IEEE sentences). The speech signal was filtered into 1/3-octave channels, all of which were discarded (in the primary experiment) save for a low-frequency (298-375 Hz) and a high-frequency (4762-6000 Hz) band. The intelligibility of this audio-only signal ranged (depending on the listener) between 9% and 31% (of the key words correct) for nine normal-hearing subjects. Visual-alone presentation of the same material ranged between 1% and 22% intelligi-bility. When the audio and video signals are combined and presented in perfect synchrony, intelligibility climbs to an average of 63%. The audio and video signals were systematically desynchronized in symmetrical fashion up to a maximum onset asynchrony of 400 ms. When the audio signal leads the video, intelligibility declines appreciably for even the shortest asynchrony of 40 ms, falling to an asymptotic level of performance for asynchronies of ca. 120 ms and longer for most subjects. In contrast, when the video signal leads the audio, intelligibility remains relatively stable for onset asynchronies up to 160-200 ms, and in some instances may actually improve. Hence, there is a marked asymmetry in the integration of audio and visual information that has important implications for sensory-based models of auditory-visual speech processing.
منابع مشابه
Visual speech improves the intelligibility of time-expanded auditory speech.
This study investigated the effects of intermodal timing differences and speed differences on word intelligibility of auditory-visual speech. Words were presented under visual-only, auditory-only, and auditory-visual conditions. Two types of auditory-visual conditions were used: asynchronous and expansion conditions. In the asynchronous conditions, the audio lag was 0-400 ms. In the expansion c...
متن کاملExplaining the visual and masked-visual advantage in speech perception in noise: the role of visual phonetic cues
Visual enhancement of speech intelligibility, although clearly established, still resists a clear description. We attempt to contribute to solving that problem by proposing a simple account based on phonetically motivated visual cues. This work extends a previous study quantifying the visual advantage in sentence intelligibility across three conditions with varying degrees of visual information...
متن کاملMultimodal Sentence Intelligibility and the Detection of Auditory-Visual Asynchrony in Speech and Nonspeech Signals: A First Report
The ability to perceive and understand visual-only speech and the benefit experienced from having both auditory and visual signals available during speech perception tasks varies widely in the normal-hearing population. At the present time, little is known about the underlying neural mechanisms responsible for this variability or the possible relationships between multisensory speech perception...
متن کاملEffects of intermodal timing difference and speed difference on intelligibility of auditory-visual speech in younger and older adults
Previous studies have revealed a temporal window during which human observers perceive physically desynchronized auditory and visual signals as synchronous. This study investigated effects of intermodal timing differences and speed differences on intelligibility of auditory-visual speech. We used 20 minimal pairs of Japanese four-mora words such as “mi-zu-a-ge” (catch landing) versus “mi-zu-a-m...
متن کامل9 Auditory ‐ Visual Speech Processing Something Doesn ’ t Add Up ErIc VATIkIoTIS ‐
The multimodal production and multisensory perception of speech have received much research attention in the past 60 years since Sumby and Pollack’s landmark demonstration that being able to see a talker’s face in noisy acoustic conditions dramatically improves speech intelligibility (Sumby and Pollack 1954). Myriad studies have pursued various conceptual lines about the production and processi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001